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Abstract 

Approximately 1% of the known protein structures display knotted configurations in their native 
fold but their function is not understood. It has been speculated that the entanglement may 
inhibit mechanical protein unfolding or transport, e.g., as in cellular threading or translocation 
processes through narrow biological pores. Protein knot manipulation has become accessible in 
single molecule experiments, e.g., leading to knot tightening and localization. Here we investigate 
tight peptide knot (TPK) characteristics in detail by pulling selected 3i and 4i-knotted peptides 
using all-atom molecular dynamics computer simulations. We find that the 3i and 4i-TPK lengths 
are typically Al ~ 4.7 nm and 6.9 nm, respectively, for a wide range of tensions {F < 1.5 nN), 
pointing to a pore diameter of ~ 2 nm below which a translocated knotted protein might get 
stuck. The 4i-knot length is in agreement with recent AFM pulling experiments. Detailed TPK 
characteristics however, may be sequence- specific: we find a different size and structural behavior 
in polyglycines, and, strikingly, a strong hydrogen bonding and water molecule trapping capability 
of hydrophobic TPKs due to side chain shielding of the polar TPK core. Water capturing and 
release is found to be controllable by the tightening force in a few cases. These mechanisms 
result into a sequence-specific 'locking' and metastability of TPKs what might lead to a blocking 
of knotted peptide transport at designated sequence-positions. Intriguingly, macroscopic tight 4i- 
knot structures are reproduced microscopically ('figure-of-eight' vs. the 'pretzel') and can be tuned 
by sequence in contrast to mathematical predictions. Our findings may explain a function of knots 
in native proteins, challenge previous studies on macromolecular knots, and may find use in bio- 
and nanotechnology. 

Keywords: protein knot, molecular dynamics simulation, AFM pulling, buried water, side-chain 
shielding, hydrogen bonds 
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I. INTRODUCTION 



After the discovery of the first knotted structure in the native fold of a protein in 1994 [l[ , 
additional studies [2!, Ill, and in particular a recent survey identified almost three hundreds 
of further knotted proteins, constituting around 1% of known structures in the protein data 
base Most of them have the simplest 3i (trefoil) topology, only a few have been found 
to posses the more complicated 4i (figure-eight) and 52-types of prime knots. While the 
question of physiological relevance is still a matter of debate [1, H, [6| it has been proposed 
that the entangled structure might have profound effects on protein folding and (forced or 
mechanical) unfolding j3, 0,0,0], e.g., to inhibit translocation through biological membranes 
or to protect against degradation by proteasome threading. In this respect it is tempting to 
speculate that the steric blocking of narrow pathways by a localized or tightly pulled protein 
knot may have bio (techno) logical significance. Also relevant, cyclotides, a family of proteins 
that have a cyclic peptide backbone, strong biological activity, and high pharmaceutical 
potential, feature a tightly packed cystine knot in their interior [9]. The synthesis and design 
of artificially interlocked molecules has become possible in supramolecular chemistry with 
app lications in bio- or nanotechnology, e.g., as molecular receptors, locks, or machines [l^, 
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The study of tight knot characteristics in (bio) polymers has a long history of interest 



13|, IJJ, Il5[. More than 20 years 



as knots easily self-tie and localize in any long chain |12l . 
ago de Gennes argued that knots may self-tie in crystallizing or sheared polymer melts 
changing their macroscopic relaxation behavior [l6| . Possible self-tying mechanisms may 
be based on electrostatic repulsion 



171], entropic tightening in worm-like chains [18|, or 



localization of (flat) polymer knots either in confinement |19l . |20| . |21| ] or in bad solvent 



conditions 13j. Externally controlled manipulation and characterization of microscopic 



knots has become accessible experimentally by employing optical tweezer methods |22l . |23 
or atomic force microscopy (AFM) 0, |24| . From a theoretical perspective, scalin g argurn ents 
fisl . [water- free) quantum calculations [2^, or coarse computer simulations jl3l. l26l. 27, 




30| have been used. Almost exclusively, previous studies focused on homogeneous 



systems such as polyethylene, DNA, or actin filaments; only recently tight protein knots 
(TPKs) with specific, inhomogeneous sequences have been investigated by Sulkowska et al. 
using an (implicit solvent) Go-model [31|. Due to local geometry (e.g., side chain size or 



kinks in the peptide) and in strong contrast to previous findings |17l . |23| . |29| . l3d . |32| knot 
localization and diffusion was dominated by jumps of the knot's ends to specific peptide 
locations suggesting qualitatively different, sequence-dependent fluctuations of TPKs when 
compared to homopolymers. 

Although highly relevant for transport, translocation, and threading processes of knotted 
proteins the size of a TPK has not been determined before. For an educated guess, consider 
a rope of (contour) length Z,,, tie a knot in it and pull it tight. The end-to-end distance I is 
now reduced by A/ = ~ ^ which we refer to in the following as the tight knot length, i.e., 
the length of the rope involved in the 'open' knot [3^ . By dividing A/ by the rope thickness 
Z), we obtain the characteristic quantity 34 



(1) 



which is minimized by the tight knot conformation, and is A=10.1 and 13.7 for a 3i and 4i, 
respectively, for idealized hard-core ropes [3?] . Assuming now a typical peptide thickness of 
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the order of an atomic size, D ~ 0.35 — 0.5 nm, we expect Al = AD ^ 3.5 — 5.1 nm and 
~ 4.8 — 6.9 nm for tight 3i and 4i peptide knots, respectively. If, naively, the knot is further 
assumed to be a circle with diameter Al = 2ttR we infer that a typical TPK radius may be 
of i? ~ 0.6 — 1.1 nm. 

In this work, we take a more detailed look at TPKs by performing explicit-water molecu\a.r 
dynamics (MD) computer simulations 3^ of 3i and 4i-knots in selected polypeptides involv- 
ing up to 30 amino acids and systematically study their size and structural behavior. We 
find tight knot lengths Al ~ 4.7 ± 0.3 nm (involving 13 ± 1 amino acids) and ~ 6.9 ± 0.3 nm 
(19±1 amino acids) for the 3i and 4i, respectively, surprisingly constant for a wide range of 
stretching forces {F < 1.5 nN), and typical tight knot radii of gyration of Rg ~ 0.7 — 0.8 nm, 
all in the range of the macroscopic estimate (1). The 4i-TPK length is in agreement with re- 
cent AFM pulling experiments on the natively knotted bacterial phytochrome [36| . Detailed 
tight knot characteristics however, may be sequence specific, e.g., we find smaller knots, 
and different structural and stability behavior in the special case of polyglycines. Strikingly, 
TPKs have a strong water capturing and hydrogen bonding capability within their closely 
packed interior, which is sequence-specific and promoted by nonpolar side-chains. Buried 
water and long-lived intra-knot hydrogen bonds lead to surprisingly rigid and stable tight 
knots in free simulations on a ~ 100 ns time scale. Intriguingly, macroscopic tight 4i-knot 
structures are reproduced microscopically ('figure-of-eight' vs. a 'pretzel'-like configuration) 
but depend on peptide sequence in contrast to mathematical predictions of the tight 4i-knot 
structure which is the 'figure-of-eight' figure We predict strongly localized tight knots 
after peptide stretching and a preferential affinity towards regions with dominantly nonpo- 
lar side chains. We demonstrate that the accurate modeling of specific side chains and the 
aqueous environment is crucial for the full understanding of TPK characteristics. 



II. METHODS AND SYSTEMS 
A. MD simulations 

Our all-atom MD simulations are performed using the software package Amber9.0 with 
the ff03 force-field and TIP3P solvent [3^. Systems are maintained at a fixed pressure of 
P = 1 bar and temperature T = 300 K by coupling to a Berendsen barostat and Langevin 
thermostat, respectively. System sizes vary between ~ 4000 and A^ ~ 8000 atoms. Elec- 
trostatic neutrality is assured by additional Na^-counterions compensating the net peptide 
charge given at pH=7. The rectangular and periodically repeated simulation box has edge 
lengths Lj. ~ ~ 30 A, while in peptide stretching direction — 5.5 — 7.0 A. Electrostatic 
interactions are calculated by particle mesh Ewald summation and real-space interactions 
have a cut-off of 9 A. Polypeptides are generated using the Amber tleap tool. Knots are tied 
into them utilizing interactive MD (IMD) in VMD [38|: while a Langevin simulation of the 
peptide is running and visualized a force can be applied to selected fragments by using the 
computer mouse so that the peptide can be dragged by hand into a finally knotted configu- 
ration. Thereafter, the system is equilibrated for ~ 5 ns with Langevin dynamics, solvated 
with TIP3P water, and further equilibrated by a ^ 5 ns MD simulation. For peptide stretch- 
ing and loosening we utilize the Amber steered MD (SMD) tool: a constant pulling velocity 
of 0.1 A/ns (0.01 m/s) drives the first and last atom (in a distance /) of the peptide backbone 
in opposite directions and force-extension curves F{1) are calculated. Pulling is terminated 
after the mean force reaches ~1.5 nN, a value at which covalent bond breaking can occur 
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experimentally [2J]. Simulation snapshots are generated using VMD [381]. Hydrogen bonds, 
radii of gyration, and rms deviations are analyzed using the Amber ptraj tool. 



B. Systems 

3i and 4i-types of knots are investigated. To study the influence of amino acid type on the 
tight knot structure we opt for three different homopeptides: the hydrophobic polyleucine 
(sequence Lat^^), the partly hydrophilic and charged polyglutamic acid (E^^J, and the slim, 
amphiphilic polyglycine {Gisf^J. The peptides have a total number of A'"aa=21 and 30 amino 
acids for the 3i and 4i-knots, respectively. Furthermore, two randomly picked pieces from 
the knotted cores of the natively 3i-knotted YibK methyltransferase |39| and the 4i-knotted 
Class II ketol-acid reductoisomerase jiO] are considered to directly connect to naturally 
occurring protein knots. In the following we name the knotted peptides by knot type and 
sequence, e.g., '3iL' for a polyleucine trefoil and '4imix' for the 4i-knot in a mixed sequence. 
The different knotted peptide systems, their amino acid (aa) sequences and numbers N^^ 
are summarized in Tab. I. 

III. RESULTS AND DISCUSSION 
A. Tight knot size and structure 

A typical initial configuration of a 3i-knotted peptide in our simulation is shown in 
Fig. 1 a) where a snapshot of 3iG is sketched before pulling it tight. The end-to-end 
extension here is / ~ 25A. A tight knot situation for the same peptide is shown in Fig. 1 b) 
for a large stretching force of ~1.5 nN (/ ~ 45A). For an elastic peptide as considered in this 
study the final 'tightness' of the knot will naturally depend on the external stretching force 
F. The calculated force-extension curve, F{1), for 3iG is shown in Fig. 2 a) together with 
the data for 3iE and 4iL. We observe an overall monotonic nonlinear increase of the force. 
Fluctuations are moderate on that scale and have local standard deviations ranging from 
~ 20 pN up to ~ 50 pN. Note that we also plot F{1) of knot loosening, i.e., 'reverse pulling', 
showing no obvious hysteresis. This indicates that our systems are close to equilibrium at 
the chosen pulling rate of 0.1 A/ns. 

In order to determine the tight knot length A/(F) = lc{F) —1{F) an accurate estimate for 
the force-dependent contour length lc{F) of the unknotted peptide is needed. For this, we 
calculate the average amino acid length Ali^[F) by measuring the mean distance between 
neighboring backbone nitrogen atoms in short unknotted peptides, presented in the inset to 
Fig. 2 a): below a stretching force of around ~10 pN the length thermally fluctuates around 
Aiaa,{F) ~ 3.5 A, then rises quickly with force in a nonlinear fashion in the low-stretching, 
thermal regime (F ~ 10 - 150 pN) to eventually increase linearly in the high stretching 
regime F > 150 pN. At F = 1.5 nN a value of A/aa(-^) — 3.8 A is reached. From the slope 
b of the linear part we estimate the linear elastic modulus F = A/aa(-^ = 0)/6 ~ 42 nN, 
which is in agreement with AFM pulling experiments, where F ~ 50 ± 15 nN j4l|. This 
agreement is remarkable since MD force-fields are typically not benchmarked to be accurate 
at considered large tensions. 

In pulling experiments, rupture of some terminal bonds at the AFM tip can occur at 
forces of around Fi =200 pN [24], where we find A/aa(-^i) — 3.68 A, leading to contour 
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length estimates lc{Fi) = A^aaA/aa - 77.3 A and /c(-Pi) ~ 110.4 A for the trefoil and the 
4i peptides, respectively. Consequently, it follows that the tight knot lengths for the trefoil 
peptides are between Al{Fi) ~ 44.3 A (3iG) and ~ 49.8 A (3iL). The number of amino 
acids involved in the knot are thus riaa = A//A/aa — 13. For the 4i-knots three of the four 
values lie between Al ~ 68.4 A and 71.9 A (naa — 19) while for the polyglycine knot (4iG) 
we find A/(Fi) ~ 60.4 A (naa — 13), around 14% smaller. The lengths are summarized in 
Tab. I. A typical error of these values is given by the fluctuations of the F{1) curve and is 
roughly of amino acid size (±3 A). 

Let us know consider more intense stretching and study the knot lengths at a larger force 
F2 = 1 nN. A/aa(-^) increases to ~ 3.76 A giving rise to a slightly larger contour length for 
the unknotted peptides. Evaluating the particular knot lengths we observe that the knots 
shrink in size (while the whole peptide is more stretched) as could have been anticipated. 
Although the pulling force is substantially increased typically only one amino acid less is 
involved in a single knot, so that surprisingly the knot sizes vary only a few percent for a 
wide range of tensions. An exception however, are both of the polyglycine peptides: here 
the tightening effect is considerable and the final knot lengths are 20-25% smaller than those 
of the other studied peptides. All lengths are summarized in Tab. 1. 

The knot lengths of ~ 4.7 nm and ~ 6.9 nm for the 3i and 4i-knots, respectively, fall inside 
the range of prediction ([1]) indicating on a first glance that they are primarily determined 
by generic packing effects with an effective excluded volume thickness D similar for most of 
the peptides. Furthermore, macroscopic arguments roughly hold on the molecular scale. In 
contrast, hydrophobicity and hydrophilicity seem to have no direct influence on tight knot 
size in the considered force regime. Examining a bit closer the nature of the amino acid 
side chains supports this statement: while the glycine side chain is identical to just a single 
hydrogen atom, a typical residue with a few carbon atoms gives rise to a more difficult 
molecular arrangement close to or inside the tight knot. This presumably leads to the 20- 
25% smaller knots in the special case of polyglycine. For the latter we thus find a smaller 
effective thickness D ^ 3.7 A, while for the other peptides D ~ 4.6 — 5.0 A. Importantly, 
apart from the polyglycine, all 4i-TPK lengths are in agreement with recent AFM pulling 
experiments on the natively knotted bacterial phytochrome (36| . 

Illustrating simulation snapshots are shown in Fig. 3 where we plot tight knot situations 
for the peptides 4iG and 4imix including their side chains. Large side chains obviously 
impede tight peptide packing. We also calculate the radius of gyration Rg of the knots, i.e., 
by averaging root mean square (rms) atomic distances from the geometric center of the atoms 
involved in the knot, i.e., constituting the length Al. We measure Rg ~ 7.2 ± 0.2 A and 
Rg ~ 7.8 ± 0.2 A for the 3i and 4i knots, respectively, with only weak dependence on the 
stretching force for all considered peptides apart from the polyglycine. For the latter radii 
of gyration are found to be close to the values above for weak stretching {F ~ 200 pN) but 
20% smaller for strong stretching (F > 1 nN). These TPK sizes are larger or comparable 
as the size of biological channels such as in the protease 0, so that a translocation of a 
knotted protein would indeed be blocked by the presence of a knot. 

An intriguing topological feature appears when inspecting the overall 4i-knot structure 
without the obscuring side chains, as in Fig. 1 c) and d). While the 4iG knot is figuratively 
indeed in a figure-eight configuration 4iL displays a 'pretzel'-like configuration. Actually, 
we find that all the considered 4i-knots expect from 4iG prefer the pretzel when inspected 
by eye. This comes as a surprise as the tight 4i-knot configuration which minimizes (1) has 
been shown to be figuratively the 'figure-of-eight', at least using simplifying mathematical 
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assumptions [3J]. Presumably the reasons are rather physics-based, i.e., high friction or 
lowering of the system free energy due more favorable amino acid interactions and packing 
around the knot region may result into the pretzel. Interestingly, the pretzel-like configu- 
ration can be a stable 4i-configuration in macroscopic knots under tension, e.g., as can be 
easily self-demonstrated using a simple computer cable, or as taught in books on cowboy 
rope tricks, see Fig. 4. 



B. Water trapping, hysteresis, and hydrogen bonding 

A striking structural feature we observe is the capability of some peptide knots to capture 
and strongly bind water molecules in their interior. The simulated peptides show this effect 
with varying magnitude, i.e., we find no bound water in polyglycine (3iG and 4iG) and 
the mixed peptide 3imix for any simulated peptide extension, while in 3iE a single trapped 
water molecule is reproducibly found only in the case of very close peptide packing at high 
forces F >1 nN. We find stronger water binding qualities for the other four peptides 3iL, 
4iE, 4iL, 4imix for a wider range of simulated peptide extensions, i.e., water was bound for 
simulation times of the order of ~10-100 ns per peptide pointing to a quite stable mechanism. 
On a first glance surprisingly, both homopeptides with the purely hydrophobic leucine side 
chains show the strongest water trapping capability. 

Simulation snapshots are shown in Fig. 5 for the peptides 4iE and 4imix: the water bonds 
to the backbone amides in the knot interior, involving at least three hydrogen bonds per 
molecule, and is rotationally immobilized. Apparently the water binding is made possible 
by the tight peptide packing in the highly bent knot allowing for multiple bonds of a water 
molecule to the polar backbone. A particularly interesting case is the water binding in 3iL. 
Here the bound water molecule is squeezed out of the knot interior for large stretching forces 
F ~1 nN. This behavior leads to a strong force peak in the force extension curve as shown 
in Fig. 2 b): for extensions / < 30 A the water molecule is bound as shown in the left 
snapshot. At / ~ 30 A and F ~ 1 nN the bound water is 'wrung' out and the force drops 
significantly before further increasing. When the knot is loosened F{1) shows a considerable 
hysteresis. However, a water molecule is captured by the knot again during loosening at 
extensions / < 27 A and F ~ 200 pN. Repeating the stretching-loosening loop twice shows 
quantitative reproducibility of this effect [cf. Fig. 2 b)]. The occurrence of the hysteresis 
points to the fact that the water binding-unbinding events fluctuate on large time-scales 
and this simulation deviates therefore from equilibrium. The magnitude of the hysteresis 
can be estimated by integrating over the F[l) stretching-loosening cycle which gives rise to 
a large dissipation energy of about AG ~ 30 — 35 k-^T, indeed comparable to the energy 
of 3-4 hydrogen bonds between a water molecule and a peptide environment (8-10 /cbT per 
hydrogen bond) (43| . 

It is a well-known fact that buried water molecules constitute an integral part of many 
native protein structures contributing to stability, flexibility, folding, and mechanical and 
enzymatic function 44, 45, 4^, 47|. Noteworthy, our measured AG is very close the binding 



enthalpy of a buried water molecule in the polar pocket of bovine pancreatic trypsin inhibitor 
(BPTI) [48], where four hydrogen-bonds constitute AH ~ 36 ksT. We find a similar large 
dissipation energy in peptide 4imix (AG ~ 20A;bT) and less pronounced in 4iE (AG ~ 
12/cbT) and 3iE (AG ~ 5/csT) due to partial water hydrogen binding events during knot 
tightening. No hysteresis is found in 4iL as water is bound here during the full stretching- 
loosening loop without any binding/unbinding transition. 
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Interestingly, in the sequence 3imix we find no water trapped in the knot interior for all 
peptide extension in contrast to 4imix, where we observe water bound on a ~ 10 ns time 
scale with three binding/ unbinding events for tension F < 500 pN. A closer inspection of the 
MD trajectory reveals that the immediate surrounding of the buried water molecule consists 
of 6 amino acids, ALD FQS, which create a mostly hydrophobic environment, see Fig. 5 b). 
This observation and the strong water binding capabilities of the polyleucines indicate that 
nonpolar side chain environments promote water hydrogen bonding to the tightly packed 
polar backbone. We explain this by the textbook-fact that hydro gen bonds are generally 
stronger in a nonpolar and/or deso/uatec? protein environment (isl . 49| . where electrostatic 



interactions are only weakly screened. We suspect that additionally the hydrophobic side 
chains impose a large energy barrier for a possible escape of a water molecule. In the 
polyglutamic acids water screening and the (probably lower) barrier is likely to be provided 
by the methylene groups of the side chains immediately surrounding the knotted peptide 
region. A nonpolar environment is clearly absent in the polyglycines. However, strong 
water trapping capability seems to results from a unique and delicate combination of local 
backbone structure and a specific, but rather nonpolar amino acid side chain environment. 

Related to this, another consequence of the tight peptide packing as further revealed by 
our simulations is the existence of long-lived hydrogen bonds between particular backbone 
amide groups. For instance during the ~ 200 ns stretching and loosening loop of polyleucine 
4iL we find that amino acids 10 and 24 hydrogen bond for ~ 80% of the simulation time. 
Detailed analysis yields similar behavior for the other peptides, i.e., intrapeptide hydrogen 
bonds are stable on a long ~10-100 ns time scale. An exception is polyglycine, where the 
longest hydrogen bond life expectancy is found to be one or two orders of magnitude shorter. 



C. Free simulations and tight knot stability 

We also conduct free simulations of the knotted peptides without any constraints in order 
to check whether the knots dissolve on a typical simulation time scale. Initial configurations 
are taken from a stretched situation with F ~ 200 pN. Dissolution of a knot is loosely defined 
here by connecting the peptide ends with an imaginary line and looking whether we find a 
knot in the closed loop or not. Only both of the polyglycine knots, 3iG and 4iG, show strong 
fluctuations and unknot quickly on a time scale of ~10 ns. All other investigated knots do 
not dissolve in a ~ 120 ns simulation pointing to a (meta)stable tight knot situation. For 
quantifying this we measure the rms deviation from the initial structure and find values of 
~ 2 A increasing quickly within ~ 10 ns to ~ 7 A for the polyglycines. For the other 
peptides however, the rms value stays at ~ 2 A for the total simulation time, supporting 
the observation that apart from the polyglycines tight knots stay stable and quite rigid after 
peptide stretching on relatively long time scales. We note that dissolution of the polyglycine 
proceeds rather via a knot 'swelling' than a 'slithering' mechanism (where the knot stays 
tight and diffuses to the end), possibly to relax the highly bent backbone. This needs not 
to be in contrast to study [l8| where an entropic tightening and slithering was predicted as 
this might be the dominant mechanism for somewhat 'looser' tight knots. 

As in the constrained case closer inspection of the knot structure reveals a few long-lived 
hydrogen bonds in all stable knots. A representative illustration is shown in Fig. 5 c) where 
we plot a MD snapshot of 4imix after a ~ 120 ns free simulation. Four hydrogen bonds are 
found between amide backbone groups right at the knot's ends clearly inhibiting the opening 
of the knot. Typically, we find that these hydrogen bonds persist on average ~ 100 ns (80- 
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90%) of the total free simulation time, even slightly longer for the polyleucines. In 3imix 
(where no buried water could be detected) the longest hydrogen-bond life time is shorter 
50-60 ns). Remarkably, in 3iL and 4iL additionally one water molecule was trapped 
during the full, unconstrained simulation, constituting a remarkable total of 7-8 long-lived 
hydrogen bonds within the knots. Again the high quantity and persistence strength of 
hydrogen bonds in 3iL and 4iL must be attributed to the desolvated, strongly nonpolar side 
chain environment of the tight knot. 



IV. CONCLUDING REMARKS 

In summary, our MD study of TPKs have revealed some generic, but also unexpectedly 
specific behavior and provoke some interesting speculations and future prospects: 

As previously conjectured, the steric blocking of narrow pathways by a localized or tightly 
pulled knot might be possible in vivo and also relevant for biotechnological purposes. In 
this light interesting are our findings that TPKs exhibit an unexpectedly strong stability 
and their radii of gyration are all indeed slightly bigger (~7-8 A) at moderate stretching 
{F < 200 pN) than the radius of the protease pore (~6.5 A) jl, 12]. We predict that a 



translocated knotted protein should get stuck in pores with a diameter below ~ 2 nm. 

Interesting for further investigation not only from a topological point of view [50| is the 
observation that most 4i-knots are figuratively not in a figure-eight but rather in a 'pretzel'- 
like configuration, which might be a (meta)stable configuration in 'physical' open tight knots 
in contrast to those underlying simplifying mathematical assumptions S^] • The pretzel may 



be stuck by high molecular friction, e.g., caused by hydrogen bonds, and/or preferred by 
the lowering of the system free energy due to favorable amino acid arrangements. 

As a striking result we find that the TPK interior has a strong water binding and hydrogen 
bonding capability which is promoted in rather nonpolar side-chain environments. These 
mechanisms result in 'locking' of the knot structure and surprisingly stable and rigid tight 
knots after peptide stretching in unconstrained MD simulations on a ~ 100 ns time scale. 
The observed quantitative reproducibility of squeezing-out and capturing a water molecule 
at well-defined tensions may allow for an external mechanical control of the capturing and 
releasing of single water molecules by designed peptide knots. Important in this respect is 
that buried water is known to be an integral part of native protein structures and not onh 



affects protein flexibility and folding [45|, |46|, |48| but can be essential for catalytic action [4 



Furthermore, TPKs might resemble structural elements of the cyclotide protein family - 
constituted by short (~ 30 aa) cyclic peptides with a tight cystine knot - which has strong 
potential in drug design In view of their structural complexity and close relation to 
native phenomena (engineered) TPKs thus might serve as an important model system for a 
deeper understanding of protein folding and stability [51], and enzymatic activity, and may 
be useful for pharmaceutical purposes due to a possible catalytic function. 

Finally, we would like to encourage further experiments in this stimulating field which are 



readily available using AFM [36j or optical tweezer methods [22, l23[ . Desirable are studies 
on protein knot length and size, stability, and diffusion behavior along stretched peptides 
and the refolding of knotted proteins after stretching. Particularly interesting is knotted 
peptide translocation for probing the mechanical forces involved in protein unfolding, knot 
tightening and blocking, e.g., by threading them through narrow biological or solid-state 



nanopores j52j. Buried water molecules in peptide knots might be detectable by nuclear 



magnetic relaxation dispersion methods (NMRD) [45|] allowing for another experimental 
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method to explore TPK fluctuations and energy landscapes. 
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TABLE I: Simulated knotted peptide systems. The peptides have A'^aa amino acids (aa) with shown 
sequence. 1^ is the estimated contour length of the unknotted peptide, / the measured end-to-end 
distance of the knotted peptide, and A.I the tight knot length involving Waa amino acids. The 
lengths are evaluated at a stretching force of Fi=200 pN and F2=l nN. 



12 



FIG. 1: MD simulation snapshots of different protein knots in a 'cartoon' representation, a) 
Initial configuration of peptide 3iG, where Z ~ 25 A is the end-to-end distance, b) Tight knot 
configuration of peptide 3iG. The end-to-end distance is I ~ 45.0 A at a stretching force F ~ 
1.5 nN. c) A tight 'figure-eight' knot configuration of peptide 4iG at F ~1 nN. d) A tight 'pretzel' 
knot configuration of peptide 4iL at F ~1 nN. 



FIG. 2: a) Force (F)-extension (Z) curves for the peptides 3iE, 3iG, and 4iL. Stretching curves 
(solid black lines) and loosening curves (dashed red lines) lie on top of each other indicating a small 
hysteresis. Pulling rate is 0.1 A/ns. The inset shows the mean distance A/aa between neighboring 
backbone nitrogen atoms vs. stretching force F in an unknotted peptide, b) Force-extension curves 
for the polyleucine 3iL. Black lines correspond to stretching while red lines correspond to loosening 
of the knot. While stretching, for extensions Z < 30 A a single water molecule is permanently 
trapped by the polar backbone of the peptide knot, see left snapshot. When exceeding ~ 30 A the 
water molecule is squeezed out (right snapshot) giving rise to a significant peak in the force- 
extension stretching curve. This transition leads to a considerable hysteresis when stretching and 
loosening curves are compared. The effect is reproducible when the stretching-loosening loop is 
repeated (dashed lines). 



FIG. 3: MD simulation snapshots of the tightly knotted 4iG (a) and 4imix (b). The backbone is 
sketched in a yellow ribbon for better identification and all amino acids are resolved in a 'licorice' 
representation (F ~ 1 nN). 
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FIG. 4: Cowboy rope trick from B. S. Mason's book 53|] showing a 4i-knot with a 'pretzel'-hke 
structure (left) and a 'figure-of-eight' structure (right). Fascinatingly, we find that both configu- 
rations are stable in proteins microscopica lly and depending on protein sequence. The book says 
instructively "The figure-of-eight is tied by precisely the same movements as t he pretzel. The 
factor determining which knot will result is the way the knot is dropped after shaking the rope off 
your arm. Jerk it as it falls and you should have the figure-of-eight. Shake it off gently and the 
pretzel should result." (Taken from http://www.inqui ry.net/outdoor/spin_rope/trick_knots.htm.) 



FIG. 5: MD simulation snapshots of water trapped in peptide 4iE (a) and 4imix (b) at a force F ~ 
1 nN. The backbone is shown in ribbon structure and only those residues are sketched ('licorice' 
representation) which are actively involved in water binding. Water (red and white spheres) is 
hydrogen bonded to the backbone amides, c) MD snapshot of 4imix in an unconstrained MD 
simulation. Four long-lived hydrogen bonds between backbone amides at the knot's ends are 
explicitly drawn (dotted blue lines). 
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